Back

Human Mutation

Wiley

Preprints posted in the last 30 days, ranked by how well they match Human Mutation's content profile, based on 29 papers previously published here. The average preprint has a 0.03% match score for this journal, so anything above that is already an above-average fit.

1
Differential causative effects of germline pathogenic variants in MUTYH and PALB2 in a patient with colorectal polyposis and breast cancer

Camacho Valenzuela, J.; Pelletier, D.; Polak, P.; Fu, L.; Hamel, N.; Domecq, C.; Ahmed, A.; Robles-Espinoza, C. D.; Foulkes, W. D.

2026-05-25 genetic and genomic medicine 10.64898/2026.05.15.26352890 medRxiv
Top 0.1%
17.3%
Show abstract

Purpose Patients carrying Germline Pathogenic Variants (GPVs) in multiple cancer susceptibility genes (CSGs) can be described within the context of Multi-locus Inherited Neoplasia Allele Syndrome (MINAS). The role of each GPV is typically interpreted based on clinical phenotypes. Here, we used tumor sequencing, particularly mutational signatures, to investigate the contribution of GPVs in MUTYH and PALB2 to colorectal polyposis and breast cancer in a single patient at a molecular level. Methods We analyzed tumor sequencing data, including mutational signatures and genomic scars, of a breast tumor and a colorectal polyp from a patient with biallelic GPVs in MUTYH and a heterozygous GPV in PALB2. Results The colorectal polyp showed a dominant contribution of MUTYH-associated Base Excision Repair deficiency (BERd) mutational signatures, with no evidence of Homologous Recombination Repair Deficiency (HRD). In contrast, the breast tumor showed both MUTYH-driven BERd and HRD-associated signatures, including SBS3, ID6 and an elevated HRD score, despite the absence of a detectable second hit in PALB2. These findings suggest a differential contribution from the CSGs, with MUTYH contributing to both lesions and PALB2 contributing specifically to the breast tumor. The observed pattern does not align with the additive or synergistic models described in MINAS. Conclusions Our study provides evidence that mutational signatures can elucidate the contribution of multiple CSGs to tumorigenesis within a single patient. These findings extend current interpretations of MINAS beyond additive or synergistic phenotypes, which may help to better understand tumor etiology, with potential clinical implications, including eligibility for targeted therapies.

2
The D4Z4caster DNA methylation signature identifies individuals at epigenetic risk for developing facioscapulohumeral muscular dystrophy (FSHD)

Jones, T. I.; Eriksen, B. Z.; Farooqi, M. N.; Gould, T.; Jones, P. L.; King, O. D.

2026-05-29 genetics 10.64898/2026.05.26.727947 medRxiv
Top 0.1%
12.8%
Show abstract

BackgroundFacioscapulohumeral muscular dystrophy (FSHD) is caused by epigenetic dysregulation at the chromosome 4q35 D4Z4 repeat array under specific permissive genetic conditions. Due to the complexity, expense, and general inaccessibility of FSHD genetic testing, many individuals displaying characteristic muscle weakness are never genetically confirmed and at-risk relatives cannot get screened. We previously developed a targeted bisulfite sequencing (BSS) protocol using the Sanger method to determine DNA methylation levels at specific D4Z4 loci relevant to distinguishing forms of FSHD from non-FSHD that can be used with DNA isolated from saliva, thereby reducing cost and increasing accessibility compared to traditional D4Z4 deletion testing that uses DNA isolated from blood. MethodsHere, we adapt the D4Z4 BSS protocol to next-generation sequencing (NGS) to increase sequencing depth and further reduce cost, validate both sequencing technologies against several cohorts of genetically defined samples, and introduce the D4Z4caster software for computing DNA methylation signatures with diagnostic utility from raw sequencing data. ResultsBoth Sanger and NGS BSS methods using D4Z4caster were validated as providing high sensitivity and specificity, with geometric mean of sensitivity and specificity (G-mean) >95% and area-under-the ROC curve (AUC) of 0.99. The NGS method allows for higher throughput and increased read depth, while the Sanger method allows faster processing of individual samples. Importantly, the NGS method could identify FSHD1 cases that are likely mosaic and would otherwise be missed. ConclusionsD4Z4caster methylation signatures can accurately detect contracted FSHD1-permissive chromosome 4q35 alleles, hypomethylation of D4Z4 arrays indicative of FSHD2, and SNPs that are important for diagnostic use. This workflow is amenable to transitioning to clinical settings for an accurate, low-cost FSHD molecular diagnostic test that could be accessible worldwide. What is already known on this topicCurrently accepted genetic diagnostics for FSHD1 are complex and expensive and can mischaracterize certain complex genetic cases. These diagnostics all require high molecular weight genomic DNA typically freshly isolated from blood, highly specialized equipment, and additional testing for FSHD2, making FSHD diagnostics the most expensive among neuromuscular diseases and inaccessible to much of the world. However, the epigenetic status of the 4q35 and 10q26 D4Z4 repeat arrays, as determined by DNA methylation status using our bisulfite sequencing-based protocol, distinguishes genetically FSHD1, FSHD2, and non-FSHD samples. Additionally, since our protocol is PCR-based, it can utilize DNA isolated from multiple sources, including saliva and buccal swabs. What this study addsThis study validates the relevant DNA methylation signatures against several large cohorts of genetically-confirmed FSHD and non-FSHD samples and optimizes the DNA methylation data analysis for the greater accuracy required for diagnostic utility, including the exclusion of nonpathogenic chromosome 10q or 4A166 contractions. In addition, we introduce the D4Z4caster analysis software, which runs in a portable and scalable Docker container, and provides increased quantitative accuracy important for: 1) confirming likely clinical cases of FSHD that do not meet the currently accepted genetic definition of FSHD1 or FSHD2, 2) identifying FSHD1 somatic mosaicism, and 3) potential prognostic applications. How this study might affect research, practice or policyFSHD1 is genetically defined by a D4Z4 array at the 4q35 locus that is contracted to 1-10 repeat units. However, disease penetrance is influenced by repeat number, epigenetic modifications, and genetic background, causing a misalignment of current genetic diagnosis with clinical diagnosis. This study will improve the accuracy of epigenetic analysis for determining cases of genetic FSHD, help broaden the definition of genetic FSHD to more accurately correspond to clinical FSHD, and allow identification of those at risk for developing clinical FSHD in affected families and in large population studies now being performed and proposed. In addition, it will better inform how an individuals epigenetic status is interpreted for potential prognostic value. Overall, this methodology is: 1) significantly less expensive than current clinically-approved FSHD diagnostic technologies, 2) more accessible due to compatibility with DNA isolated from multiple sources including saliva, and 3) compatible with the current sequencing equipment and workflow for DNA isolation used in commercial clinical laboratories. Together, these advantages will help move the technology toward becoming an approved molecular diagnostic test for FSHD in the USA, Europe, and countries currently lacking clear access to testing.

3
Cortical Organoid Model of PPP2R5D Genetic Intellectual Disability Models Disease Severity Phenotype

Du, Y.; Singh, M.; Patil, M.; Villeagas, I.; Portillo, A.; Shang, K.; Ben-Shalom, R.; Halmai, J.; Fink, K.

2026-05-27 cell biology 10.64898/2026.05.26.728012 medRxiv
Top 0.1%
7.0%
Show abstract

Jordans Syndrome (JS) is a rare, neurodevelopmental disorder caused by de novo missense mutations in protein phosphatase 2 regulatory subunit Bdelta (PPP2R5D). JS is characterized by severe neurological impairments starting in early life. PPP2R5D encodes for B56{delta}, one of the regulatory subunits of protein phosphatase 2A (PP2A). PP2A is a heterotrimeric protein serine/threonine phosphatase that is highly expressed in the brain and the liver. Past studies have focused on PP2As role in liver and little is known about the holoenzymes behavior in neuronal cells. Although B56{delta} is known to play an important role in the substrate specificity of PP2A, the identification of validated downstream substrates in JS remains unclear. To better understand how the mutations affect neuronal cells, we developed cerebral cortical-like organoids from an engineered allele series of the most common JS mutations to characterize the physiological changes throughout different stages of neurodevelopment. Organoids were assessed for transcriptomic, protein, and electrophysiological changes utilizing bulk RNA sequencing, immunocytochemistry, Western Blot, and high-density MicroElectrode Array. The results identify differentially expressed genes and translated proteins, potential neuronal substrates, and significant electrophysiological signatures that suggest mutations in B56{delta} lead to variant-specific dysfunction of PP2A. Overexpression of PPP2R5D through AAV transduction of organoids rescued several phenotypes in the variants, suggesting different pathogenetic etiology underneath. Our findings successfully characterized cerebral cortical-like organoids in JS cell lines and demonstrated its potential as a model for studying neurodevelopmental disorder and for screening therapeutic approaches.

4
Building an Interoperable Rare Disease Multi-omic Resource: The GREGoR Data Model and Dataset

Heavner, B. D.; Wheeler, M. M.; Bengtsson, J. D.; Carvalho, C. M. B.; Cheung, W. A.; Conomos, M. P.; Delot, E. C.; DiTroia, S.; Ganesh, V. S.; Gogarten, S. M.; Grochowski, C. M.; Jhangiani, S. N.; King, C. H.; LeMaster, C.; Marvin, C. T.; Marwaha, S.; Miller, D. E.; O'Donnell-Luria, A.; Pais, L.; Patterson, K.; Qi, G.; Richardson, M.; Smail, C.; Stilp, A. M.; Tong, C. C.; Ungar, R. A.; Weisburd, B.; Bamshad, M. J.; Bernstein, J. A.; Eichler, E. E.; Gibbs, R. A.; Lupski, J. R.; May, S. J.; Montgomery, S. B.; Pastinen, T.; Posey, J.; Rehm, H. L.; Shojaie, A.; Talkowski, M. E.; Vilain, E.; Wei, C

2026-05-19 genomics 10.64898/2026.05.15.725546 medRxiv
Top 0.1%
6.7%
Show abstract

Rare disease research and diagnosis rely on the integration of genomic and phenotypic data generated across diverse clinical sites; however, the absence of widely adopted standards for representing genomic data and associated metadata has limited data interoperability, reuse, and cross-study analysis. The Genomics Research to Elucidate the Genetics of Rare Diseases (GREGoR) Consortium was established to investigate challenging rare disease cases and evaluate emerging multi-omic technologies for clinical translation. To support coordinated data integration across distributed research sites, we developed a common Consortium Data Model in partnership with domain experts to standardize the capture of participant-, family-, phenotype- and assay-level metadata, with a particular emphasis on using a modular architecture to support linking of multiple data versions from multiple omic technologies to a single individual and attribution of a genetic finding to the specific technology used for its initial discovery. Adoption of the GREGoR Data Model has enabled continued generation and public release of a harmonized, analysis-ready Consortium Dataset. The most recent release includes phenotypic, family and multi-omic data from 12,292 participants in 5,029 families. Other rare disease data sharing efforts are beginning to adopt this data model which will facilitate cross consortium analyses and empower rare disease research. This work demonstrates that a collaborative, flexible, and scalable data model can enable large-scale rare disease research, facilitate cross-center data harmonization, and enable data interoperability.

5
Phenotype-Specific Recalibration of MAVE Data Enables Repurposing of BAP1 Functional Assays for Kury-Isidor Syndrome

Gupta, P.; Balton, E. V.; Tejura, M.; Kumar, R. D.; Snyder, M. W.; Stone, J.; Villani, R. M.; Peter, B. H.; Sirisak, C.; Ian, G. A.; Martha, H.-P.; Danny, M. E.; Jane, R.; Elisabeth, R. A.; Andrew, S. H.; Mark, W.; Undiagnosed Diseases Network (UDN), ; Kathleen, L. A.; Matthew, B. D.; Melissa, M. J.; Gail, J. P.; Katrina, D. M.; Elizabeth, B. E.; Fowler, D. M.; Starita, L. M.; McEwen, A. E.; Stergachis, A. B.

2026-05-21 genetic and genomic medicine 10.64898/2026.05.15.26352805 medRxiv
Top 0.1%
5.3%
Show abstract

Purpose Multiplexed assays of variant effect (MAVEs) are transforming clinical variant interpretation. However, many genes are associated with more than one disease, making it unclear whether functional data generated in one disease context may be directly applicable to another. For example, germline BAP1 missense variants are associated with both BAP1 tumor predisposition syndrome (BAP1-TPDS) and Kury-Isidor syndrome (KURIS), a rare neurodevelopmental disorder. Here, we demonstrate how phenotype-specific calibration of BAP1 MAVE data enables disease-specific variant classification. Methods Saturation genome editing (SGE) data for BAP1 were recalibrated using either BAP1-TPDS- or KURIS-associated missense variants as pathogenic controls. Functional evidence strength was quantified using the Odds of Pathogenicity (OddsPath) framework and mapped to ACMG/AMP PS3/BS3 criteria. Recalibrated functional evidence was integrated with standard clinical criteria for variant classification. A workshop was developed to teach phenotype-specific MAVE recalibration to clinicians and variant curators and evaluated for educational impact. Results Phenotype-specific recalibration using BAP1-TPDS and KURIS controls yielded OddsPath values consistent with PS3_Strong evidence in both contexts. Application of KURIS-specific recalibration enabled the diagnosis of KURIS in an individual with a previously uncertain BAP1 missense variant. The educational workshop enabled quantitatively improved understanding in applying functional evidence. Conclusion Phenotype-specific recalibration enables appropriately calibrated reuse of MAVE datasets across distinct disease contexts, increasing the clinical utility of MAVE datasets and the interpretability of variants in pleiotropic genes. This framework expands the diagnostic utility of existing functional datasets without requiring new experimental assays.

6
Measuring the Meaning of Genomic Results: Harmonization of the Metric for Case-Level Results in the CSER2 Consortium

Powell, B. C.; Amendola, L. M.; Bonini, K. E.; Crosslin, D.; Desrosiers-Battu, L.; Hiatt, S. M.; Hindorff, L.; Kenny, E. E.; Mavura, Y.; Muenzen Ferar, K. D.; Risch, N.; Roman, T.; Slavotinek, A.; Van Ziffle, J.; Bowling, K. M.

2026-06-01 genetic and genomic medicine 10.64898/2026.05.28.26354388 medRxiv
Top 0.2%
3.7%
Show abstract

Yield of reported results from genetic testing provides a proximal measure of clinical usefulness. While ACMG/AMP guidelines provide representations of uncertainty for individual genetic variant classification, additional factors are considered when determining whether results explain a patient's presentation. To standardize cross-consortium analysis, a working group of the Clinical Sequencing Evidence-Generating Research (CSER2) consortium iteratively identified factors used when contextualizing variant-level results to case-level interpretation (i.e., interpretation of an individual's genetic data with respect to the indication for testing). Sites independently categorized results; complex cases were discussed collaboratively, leading to revision of classification categories. Our metric incorporates factors beyond classification of reported variants. Analogous to variant-level results, "Definitive Positive" and "Probable Positive" represent certainty that results may be clinically explanatory. The category "Inconclusive" applies when results may or may not fully explain the patient presentation, with subdivision into multiple (non-exclusive) subcategories. Cases falling outside all of the other categories are considered "Negative". The overall diagnostic yield by this metric and use of categories for inconclusive results varied by CSER project, in part paralleling study design differences. This case-level categorization provides a meaningful assessment of diagnostic yield, and for inconclusive cases identifies potentially resolvable factors for case resolution.

7
Integrating enriched case data from national laboratory testing with population-based case-control analyses: a novel statistical likelihood-ratio methodology for PS4 applied to 325,345 breast cancer cases and 671,006 controls

Allen, S.; Rowlands, C. F.; Garrett, A.; Couch, F.; Richardson, M. E.; Pesaran, T.; Pethick, J.; Lavelle, K.; McRonald, F.; Vernon, S.; Torr, B.; Loong, L.; Aungraheeta, R.; Durkie, M.; Burghel, G. J.; Callaway, A.; Robinson, R.; Field, J.; Frugtniet, B.; Palmer-Smith, S.; Grant, J.; Pagan, J.; McDevitt, T.; Snape, K.; Hanson, H.; McVeigh, T.; Loveday, C.; Jones, M.; Hardy, S.; Turnbull, C.; CanVIG-UK,

2026-05-17 genetic and genomic medicine 10.64898/2026.05.13.26353095 medRxiv
Top 0.2%
3.7%
Show abstract

Background: For many evidence criteria within v3.0 of the ACMG/AMP guidelines, methodologies have been developed to empower their use outside the stipulated evidence strengths. However, no such methodology has been established for case-control data (PS4). With the release of large-scale unselected case-control datasets and expansion of nationally-collected laboratory datasets enriched for pathogenic variant carriers, there is potential to combine datasets across ascertainment contexts in a more quantitative manner using novel likelihood ratio tools. Methods: Using our published PS4-LR-Calculator, we calculated a combined log likelihood ratio (PS4-LLR) across five datasets (three unselected, and two enriched), and estimated enrichment of pathogenic variants in clinically-ascertained laboratory data using truncating variant prevalence. Results: Data were combined for 10,817 missense variants from 325,345 female breast cancer patients and 671,006 controls of Western European ancestry for five breast cancer susceptibility genes (BRCA1, BRCA2, PALB2, ATM, CHEK2). A combined LLR was produced for 4,690 missense variants; 927 variants received evidence towards pathogenicity (LLR[≥]1), and 3,242 received evidence towards benignity (LLR[≤]-1). Conclusion: This flexible, variant-level methodology combines nationally-collected 'enriched' datasets with unselected case-control cohorts, expanding the available information for case-control analysis, boosting power, enabling exploration of atypical penetrance and empowering variant classification.

8
Whole-exome-based preconception carrier screening in Uzbekistan with targeted SMA, FMR1, and DMD assays: the first reported clinical program

Kullyev, A.; Avdeichik, S.; Akimenkova, A.; Kartuesov, A.; Kardymon, O.; Goikhman, Y.

2026-06-04 genetic and genomic medicine 10.64898/2026.06.02.26354713 medRxiv
Top 0.2%
3.7%
Show abstract

Abstract Purpose: Published clinical outcome data on preconception carrier screening (PCS) in Central Asia are limited. We report the first clinical implementation study from Uzbekistan of a whole-exome sequencing (WES)-based multi-platform PCS program combining exome sequencing with targeted SMA, FMR1, and DMD assays. Methods: We retrospectively analyzed anonymized data from 65 individuals (19 couples, 27 singletons) screened at IMC Genomics, Tashkent, between January 2024 and May 2026. WES covering the protein-coding regions of approximately 20,000 genes was followed by exome-wide bioinformatics filtering and clinical geneticist interpretation. Partly overlapping cohorts underwent SMA carrier screening (n=179), FMR1 CGG-repeat analysis in females (n=155), and DMD deletion/duplication testing in preconception females (n=29). Variants were classified by ACMG/AMP criteria against gnomAD v4.1. Results: Sixty-one of 65 WES-screened individuals (93.8%; 95% CI 85.2 - 97.6%) carried at least one reportable variant (152 instances across 126 genes). Four of 19 couples (21.1%; 95% CI 8.5 - 43.3%) were concordant for pathogenic or likely pathogenic variants in the same autosomal recessive gene; two were referred for preimplantation genetic testing for monogenic disease. SMA screening identified four carriers, including two 2+0 silent carriers; FMR1 analysis identified one intermediate allele; DMD MLPA identified no exonic rearrangements. Conclusion: This first reported WES-based multi-platform PCS program in Uzbekistan was feasible and clinically informative, identifying actionable couple-level reproductive risks and supporting structured implementation of reproductive genetic screening in Central Asia.

9
Rare neurological and neurodevelopmental variants in ALS link to onset, survival and family history

O'Donoghue, C.; Kacar, E.; Gomes, T.; Costello, E.; Pender, N.; Peelo, C.; Ryan, M.; Heverin, M.; Byrne, S.; Bede, P.; Hardiman, O.; McLaughlin, R. L.; Byrne, R. P.

2026-06-10 genetic and genomic medicine 10.64898/2026.06.09.26354977 medRxiv
Top 0.2%
3.6%
Show abstract

Background: Neurological, neuropsychiatric, and neurodevelopmental disorders cluster in ALS families, sharing a common genetic architecture with ALS. Pathogenic variants in genes associated with other neurological, neurodevelopmental, or neuropsychiatric disorders may also co-occur in ALS and modify phenotype. We have sought to determine the prevalence and clinical pattern of likely-pathogenic/pathogenic (LP/P) non-ALS neurological, neurodevelopmental, and neuropsychiatric variants, alone and in combination with ALS-gene variants, in two large ALS cohorts. Methods: Whole-genome sequencing (WGS) of 469 Irish and 774 Answer ALS people with ALS (pwALS) was analysed for ClinVar LP/P variants associated with other neurological (n = 15541), neurodevelopmental (n = 9761), and neuropsychiatric (n = 321) phenotypes. Inheritance patterns for associated genes (autosomal recessive/autosomal dominant) along with the associated phenotype were validated using OMIM. Standardised clinical data included family history, site and age of onset, El Escorial category, survival, motor decline, and cognitive and behavioural assessments. Known ALS-gene variants and C9orf72 repeat expansion status were included for each cohort. Results: Non-ALS neurological variants were identified in 47/469 (10.0%) Irish and 69/774 (8.9%) Answer ALS participants, most frequently in hereditary spastic paraplegia-associated genes (3.2% Irish; 2.8% Answer ALS). Irish neurological variant carriers showed higher frequency of respiratory onset (10.6% vs 1.2%, Fisher's exact p = 0.002, {Phi} = 0.20) and fewer premorbid behavioural symptoms (0.92 +/- 0.56 vs 3.08 +/- 0.97, Cohen's d = -0.40). Neurodevelopmental variants occurred in 12/469 (2.6%) Irish and 20/774 (2.6%) Answer ALS participants. In the Irish cohort, neurodevelopmental variant carriers had significantly shorter survival in Cox proportional hazards model (log-rank p = 0.005), corresponding to a more than two-fold increased hazard of death (HR = 2.25, 95% CI 1.26-4.00), and had significantly increased familial burden of neuropsychiatric disorders among first- and second-degree relatives (negative binomial IRR for carriers = 2.41, 95% CI: 1.12-5.18, p = 0.025). Across combined cohorts, 18 individuals (Irish n = 8; Answer ALS n = 10) carried [≥]2 LP/P variants spanning ALS and non-ALS genes. Conclusion: Rare LP/P variants in genes associated with other neurological and neurodevelopmental disorders occur in up to 12% of pwALS across two independent cohorts. Carriers show distinct phenotypes, shorter survival, and characteristic family history patterns. These findings suggest that extended pleiotropic and oligogenic architectures may contribute to ALS heterogeneity.

10
The Genetic Landscape and Epidemiological Characteristics of Inherited Retinal Diseases in the Chinese Population

Zeng, B.; Cui, Z.; Zhou, S.; Dai, W.

2026-05-29 ophthalmology 10.64898/2026.05.27.26354224 medRxiv
Top 0.2%
3.5%
Show abstract

Background: Inherited Retinal Diseases (IRDs) are a group of genetically heterogeneous blinding conditions. Major global genomic reference databases are disproportionately enriched for individuals of European ancestry. This underrepresentation creates a significant bias that impedes the accuracy of genetic diagnosis in the Chinese population. This study aims to address this limitation by constructing a comprehensive genetic landscape of IRDs using large-scale deep-sequencing data from a large Chinese cohort. Methods: The study leveraged variant data primarily from 10,588 individuals in the China Metabolic Analytics Project (ChinaMAP) and cross-referenced findings against multiple national and international databases. We systematically curated variants within a targeted panel of 291 IRD-associated genes. Variant pathogenicity was assessed using a comprehensive pipeline integrating InterVar-automated classification based on 2015 American College of Medical Genetics and Genomics/Association for Molecular Pathology (ACMG/AMP) guidelines, ClinVar evidence (review status [≥] 1 star), and manual literature curation. We delineated the mutational spectrum, identified population-enriched pathogenic/likely pathogenic (P/LP) variants, and analyzed the distribution characteristics of IRD-associated highly-mutated genes. Furthermore, we calculated the carrier frequencies (CF) and genetic prevalence (GP) of autosomal recessive(AR)-IRD genes in the Chinese population. Results: The study revealed a highly concentrated genetic landscape for AR-IRDs in the Chinese population, with ABCA4 and USH2A emerging as the primary drivers of the genetic burden. This finding aligns with previous Chinese cohorts but contrasts with global databases, where genes such as the X-linked RPGR are more prevalent. In contrast, autosomal dominant (AD)-IRDs exhibited high locus heterogeneity, with pathogenic variants dispersed across numerous genes (e.g., COL2A1 and MFN2). We identified a series of P/LP variants that were either high-frequency or significantly enriched in the Chinese population, such as CNGB1 (p.P530R) and specific recurrent alleles in ABCA4 and CYP4V2. The estimated cumulative CF for AR-IRDs was 1 in 5.60, and the theoretical total GP was 1 in 2,624.67, based on the ChinaMAP data. Conclusion: By integrating the ChinaMAP dataset with diverse genomic resources, this study provides a genetic landscape of IRDs in the Chinese population. Our analysis shows a concentrated mutational spectrum in AR-IRDs, contrasting with the pronounced heterogeneity in AD-IRDs. These findings, including population-specific pathogenic variants and refined prevalence estimates, provide a resource for precision diagnostics, genetic counseling, expanded carrier screening (ECS), and public health policy development in China.

11
Stratified evaluation of blood RNA sequencing in a rare disease cohort

Duzenli, T.; Durmus, S.; Kaya, H. E.; Sevilgen, F. E.; Kayhan, G.; Cakir, T.; Ergun, M. A.

2026-05-28 genetic and genomic medicine 10.64898/2026.05.27.26353804 medRxiv
Top 0.2%
3.5%
Show abstract

Background: RNA sequencing (RNA-seq) is increasingly recognized as a complementary tool to DNA-based sequencing for improving the diagnostic yield in Mendelian disorders. However, how the diagnostic performance of RNA-seq varies across molecularly and phenotypically distinct patient subgroups remains poorly defined. This study aimed to evaluate and compare the diagnostic utility of RNA-seq across three stratified groups of patients with non-diagnostic exome sequencing. Methods: We performed RNA-seq on whole blood samples from 90 patients with suspected Mendelian disease in whom clinical exome or whole-exome sequencing had failed to establish a molecular diagnosis. Patients were prospectively stratified into three groups of 30: (i) patients with a candidate variant of uncertain significance (VUS) with predicted splicing impact (Group 1), (ii) patients with a specific clinical pre-diagnosis but no identified pathogenic variant (Group 2), and (iii) patients without a specific pre-diagnosis or candidate variant (Group 3). Aberrant splicing, gene expression outliers, and allele-specific expression were analyzed using multiple bioinformatic tools and compared against a GTEx-derived control cohort. Results: RNA-seq contributed to a molecular diagnosis in 29 of 88 evaluable patients (32.9%). Diagnostic yield differed substantially across groups: 82.8% (24/29) in Group 1, 6.9% (2/29) in Group 2, and 10% (3/30) in Group 3. In Group 1, RNA-seq enabled reclassification of candidate VUS through direct demonstration of aberrant splicing events. In Group 2, RNA-seq identified a somatic mosaic ACTB variant missed by exome sequencing and reclassified a previously deprioritized APPL1 VUS. In Group 3, a deep intronic pseudoexon-activating variant in IGBP1 was identified in two siblings with severe microcephaly, providing evidence for a candidate X-linked microcephaly gene, and a pathogenic RNU4-2 variant was detected in a patient with ReNU syndrome, a non-protein-coding gene not captured by standard exome sequencing. Conclusions: RNA-seq has the highest diagnostic utility when applied to evaluate candidate splice variants identified by prior DNA testing but also provides independent diagnostic value in patients without candidate variants. The systematic comparison across stratified patient groups supports the integration of RNA-seq into clinical genomic workflows and highlights the need for standardized analytic frameworks.

12
Dried blood spot proteomics as a diagnostic framework for citrin deficiency

Totsune, E.; Nakajima, D.; Konno, R.; Mikami-Saito, Y.; Arai-Ichinoi, N.; Nishida, H.; Yagi, H.; Ishige, T.; Suzuki, H.; Shirota, M.; Takayama, J.; Takano-Asai, C.; Shimura, M.; Sasai, H.; Lee, T.; Kido, J.; Nakajima, Y.; Kobayashi, H.; Kikuchi, A.; Numakura, C.; Hamazaki, T.; Oishi, K.; Nakamura, K.; Kawashima, Y.; Ohara, O.; Wada, Y.

2026-05-28 genetic and genomic medicine 10.64898/2026.05.26.26354012 medRxiv
Top 0.2%
3.2%
Show abstract

Background: Citrin deficiency, caused by biallelic pathogenic variants in SLC25A13, must be identified early to prevent serious complications such as hyperammonemia and liver failure. However, clinical diagnosis is often delayed due to its nonspecific presentation and limited sensitivity of amino acid-based newborn screening methods. Although genome-based evaluations are being investigated to address these issues, concerns about their cost, turnaround time, variant interpretation ability, and data handling highlight the need for a more practical yet reliable alternative. We investigated the feasibility of applying proteomic approach on dried blood spots (DBS), which are routinely used in newborn screening. Methods: We performed untargeted liquid chromatography-tandem mass spectrometry to analyze the proteome of DBS using a previously developed "non-targeted analysis of non-specifically DBS-absorbed proteins" (NANDA) workflow. SLC25A13 protein abundance was quantified in individuals with biallelic loss-of-function mutations, compound loss-of-function/missense mutations, and heterozygous carriers; this was also evaluated in healthy and diseased controls representing relevant differential diagnoses. To leverage proteomic information, we derived a multivariate proteomic signature using feature selection and evaluated its performance with leave-one-out cross-validation. Biological relevance was assessed by enrichment analysis, and complementary transcriptomics was performed using RNA sequencing. Results: A total of 7,474 proteins, including SLC25A13, were consistently detected in DBS. SLC25A13 was undetectable in individuals with biallelic loss-of-function mutations. However, individuals with compound loss-of-function/missense genotypes showed reduced but measurable SLC25A13 levels, comparable to those observed in heterozygous carriers. In contrast, a compact 15-protein signature accurately identified individuals with compound loss-of-function/missense genotypes (AUC, 0.99; sensitivity, 1.00; specificity, 0.95). The signature was enriched for Ca2+-response, and transcriptomics showed downregulation of genes related to multimodal ion channels in affected individuals compared to controls. Conclusions: DBS-based proteomic profiling may assist in the diagnosis of citrin deficiency through SLC25A13-quantification and a biologically plausible multivariate signature. More broadly, this strategy offers a promising new diagnostic layer for protein disorders, providing a proteomic readout in a clinically practical DBS format with potential utility for future diagnostic and screening applications.

13
Intra-Abdominal Bowel Dilation in Experimental Gastroschisis is Associated with a Modifiable Transcriptomic Program of Intestinal Dysfunction

Guerra, M. E.; Arai, T.; Joyeux, L.; Baxter, C. C.; Bose, S.; Thevasagayampillai, S.; Li, H.; Yu, L.; Akondy, V.; Scuglia, M.; Basurto, D.; Van den Eede, E.; Vergote, S.; Watananirum, K.; Tianthong, W.; Russo, F.; De Coppi, P.; Gunaratne, P. H.; Cheng, L. S.; Belfort, M. A.; Balaji, S.; Deprest, J.; Keswani, S. G.

2026-06-02 developmental biology 10.64898/2026.05.29.728798 medRxiv
Top 0.2%
3.1%
Show abstract

STRUCTURED ABSTRACTO_ST_ABSObjectiveC_ST_ABSTo characterize intestinal transcriptional profiles in gastroschisis, their temporal evolution, and response to fetal intervention. Summary Background DataGastroschisis causes significant intestinal dysfunction, with intra-abdominal bowel dilation clinically shown to correlate with worse outcomes. While inflammation and neurovascular impairment have been implicated, genome-wide transcriptional characterization of disease severity remains lacking. MethodsUsing a fetal ovine model of complex gastroschisis, in which all gastroschisis animals demonstrated significant intra-abdominal bowel dilation at term, bulk RNA sequencing was performed on proximal small intestinal tissue from mid-gestation and term fetuses across three groups: normal, gastroschisis, and prenatally repaired gastroschisis. Differential gene expression (FDR [≤] .05, |log2 fold change| [≥] 1.5) and pathway enrichment analyses were performed, with targeted interrogation of extracellular matrix (ECM), enteric nervous system (ENS), angiogenic, and inflammatory pathways. ResultsAt mid-gestation, gastroschisis intestine showed minimal transcriptional differences (150 differentially expressed genes [DEGs]) and some bowel dilation. By term, dysregulation was substantial (2,423 DEGs) alongside significant dilation. Normal ontogenetic intestinal maturation patterns were altered, with fewer expected developmental gene changes and discordant pathway regulation. ECM pathway aberrations emerged early and persisted, while ENS, angiogenic, and inflammatory pathways were only dysregulated at term. Fetal repair was associated with normalization of gene expression at term (29 DEGs vs controls). ConclusionIntestinal transcriptional changes in experimental gastroschisis parallel progressive bowel dilation, consistent with a mechanical stress contribution to intestinal injury. Prenatal repair normalizes both dilation and gene expression, indicating a dynamic and potentially modifiable transcriptional program that supports the rationale for early fetal intervention. Mini AbstractIn a fetal ovine model, progressive bowel dilation in gastroschisis parallels transcriptomic dysregulation of ECM remodeling, neurovascular impairment, and inflammation which is normalized by prenatal repair.

14
EYA1/EYA2 and EYA3/EYA4 act as stage-specific SIX cofactors in embryonic and adult regenerative skeletal myogenesis

Viaut, C.; Wurmser, M.; Jauliac, E.; Ben Driss, L.; Backer, S.; Madani, R.; Issa, F.; PIROZHKOVA, I.; Sotiropoulos, A.; Amthor, H.; Maire, P.

2026-05-22 developmental biology 10.64898/2026.05.20.726470 medRxiv
Top 0.2%
2.6%
Show abstract

Eya3 and Eya4 are two Eya genes expressed in adult myogenic stem cells, where they may act as SIX cofactors. We analyzed muscle regeneration in single and compound Eya3 and satellite cell-specific Eya4 mutant mice. A kinetic analysis of muscle regeneration after Notexin injury of the Tibialis Anterior revealed no major phenotype at 4, 14, and 30 days after injury in terms of PAX7+ cell number and myofiber cross-sectional area in Eya3 mutants, while all parameters were decreased in Eya4 mutants and further worsened in Eya3/Eya4 double mutants, in which we also observed a modification of the myofiber phenotype at 30 days after injury. Satellite cells were cultured ex vivo and Eya4 deletion was induced by Ad-Cre-mediated recombination. While single Eya3 mutant cells showed normal proliferation and differentiation, double mutant cells exhibited normal proliferation but failed to fuse. Analysis of their transcriptome revealed that the expression of Myomixer, Follistatin, and Noggin was severely downregulated specifically in double mutant cells, explaining their fusion deficiency. To gain a better understanding of the involvement of Eya genes during embryonic development and the genesis of PAX7+ myogenic stem cells, we analyzed Eya1 / ;Eya2 / , Eya3 / , Eya4 / , and Eya3 / ;Eya4 / E18.5 mutant fetuses at the limb and craniofacial levels. In Eya1 / ;Eya2 / fetuses, we confirmed the absence of distal limb muscles and observed reduced craniofacial muscles. In Eya3 / ;Eya4 / fetuses, craniofacial myogenesis appeared preserved and PAX7+ myogenic stem cells were present. BackgroundThe Eyes absent (Eya) genes encode transcriptional co-activators and phosphatases that function within the PAX-SIX-EYA-DACH (PSED) regulatory network. In skeletal muscle, EYA proteins cooperate with SIX homeoproteins to control myogenic gene expression during both embryonic development and adult regeneration. While Eya1 and Eya2 are predominantly expressed in embryonic myogenic progenitors and Eya3 and Eya4 are the dominant paralogs in adult satellite cells (SC), the specific and redundant contributions of individual family members to myogenesis remain poorly characterized. MethodsWe analyzed compound Eya mutant mice during adult Tibialis anterior muscle regeneration and during embryogenesis. We complemented this analysis by performing ex vivo myogenic stem cell cultures from compound Eya mutants and examining their fusion capacity. ResultsAnalysis of muscle regeneration following Notexin injury revealed that Eya2 and Eya3 single mutants display no major regenerative deficit. In contrast, satellite cell-specific deletion of Eya4 (Eya4sc/sc) caused a transient impairment of early regeneration, with reduced numbers of smaller regenerating MYH3+ (embryonic myosin heavy chain) myofibers and a transient decrease in SC number at 4 days post-injury (dpi). Compound Eya3-/-;Eya4sc/scdouble mutants showed a more severe and persistent phenotype, with decreased myofiber cross-sectional area, reduced myonuclear accretion, accumulation of PAX7+ cells associated with regenerated myofibers, and altered fiber-type composition at 14 and 30 dpi. Ex vivo analysis of double mutant SCs revealed a specific and complete blockade of myogenic fusion without defects in proliferation or MYOD expression. Transcriptomic analysis identified severe downregulation of Myomixer, Noggin, and Follistatin in differentiating Eya3-/-;Eya4-/- SCs. Open-access SIX1 and SIX4 ChIP-seq publicly available data confirmed direct binding at the Myomixer, Noggin, and Follistatin loci, supporting a direct SIX-EYA transcriptional mechanism. In parallel, embryonic analysis demonstrated that Eya1-/-;Eya2-/-E18.5 fetuses lack distal limb musculature and display severe craniofacial muscle hypoplasia, while in Eya3-/-;Eya4-/-fetuses limb and craniofacial musculature developed with no detectable defects. ConclusionsThese results reveal distinct temporal requirements for EYA proteins in skeletal muscle: EYA1 and EYA2 are essential SIX cofactors for embryonic myogenic fate acquisition in hypaxial and craniofacial progenitors, while EYA3 and EYA4 act redundantly in adult satellite cells to enable myogenic fusion by maintaining BMP antagonist expression and Myomixer activation downstream of the SIX-EYA transcriptional complex.

15
In vitro splice-switching oligonucleotide rescues aberrant GFM2 pseudoexon inclusion and restores mitochondrial activity

Gross, S.; Birnbaum, R.; Shaul Lotan, N.; Mor-Shaked, H.; Manor, J.; Shaag, A.; Rosenbluh, C.; Levy-Memo, A.; Yanovsky-Dagan, S.; Saada, A.; Harel, T.

2026-06-01 genetic and genomic medicine 10.64898/2026.05.28.26354078 medRxiv
Top 0.2%
2.4%
Show abstract

Background: Biallelic variants in GFM2, encoding mitochondrial elongation factor G2 (mtEFG2), a GTPase involved in the termination stage of mitochondrial translation, cause autosomal recessive combined oxidative phosphorylation deficiency. Noncoding structural variants may be missed by exome sequencing but can disrupt splicing and provide opportunities for variant-specific therapeutic rescue. We investigated the molecular mechanism underlying suspected Leigh syndrome in an infant with mitochondrial disease and evaluated whether splice-switching oligonucleotide (SSO) treatment could correct the pathogenic splicing defect. Methods: The proband underwent exome sequencing followed by short-read and long-read whole genome sequencing. RNA sequencing, reverse-transcription PCR, quantitative PCR, and cycloheximide treatment were used to characterize the effect of the identified intronic duplication on GFM2 splicing and transcript stability. Patient-derived fibroblasts were treated with SSOs targeting the aberrant splice junction. Rescue was assessed by RNA studies, western blotting, and spectrophotometric measurement of cytochrome c oxidase (COX). Results: Whole genome sequencing identified a paternally-inherited GFM2 missense variant, NM_032380.5:c.2195C>T p.(Pro732Leu), in trans to a maternally-inherited 221-nucleotide intronic duplication, NM_032380.5:c.2029-741_2029-521dup. RNA studies revealed a 87-nucleotide pseudoexon, generated by activation of a cryptic acceptor splice site within the duplicated sequence. The resulting transcript harbored a premature termination codon (PTC) and underwent nonsense-mediated decay, as confirmed by cycloheximide rescue. Together with reduced mtEFG2 protein levels on western blot, the findings supported a loss-of-function mechanism. Enzymatic analysis of affected fibroblasts showed reduced activity of the mtDNA-dependent complex IV subunit COX, with preservation of the nuclear-encoded complex II enzyme succinate dehydrogenase and the control enzyme citrate synthase, consistent with impaired mitochondrial translation. A SSO targeting the aberrant intron-pseudoexon junction nearly abolished pseudoexon inclusion, restored correctly spliced GFM2 transcript from the duplication-containing allele, increased mtEFG2 protein levels, and significantly improved COX activity. Conclusions: This study identifies a pathogenic intronic GFM2 duplication that causes mitochondrial disease through pseudoexon activation and nonsense-mediated decay. The findings demonstrate the value of integrated genome and transcriptome analysis for exome-negative mitochondrial disease and provide in-vitro proof of concept that SSOs can restore transcript processing, protein expression, and mitochondrial respiratory-chain function in patient-derived cells.

16
Multisite Evaluation of an Amplification-based Nanopore Sequencing Solution to Analyze Challenging Clinically Relevant Variants in Genes Associated with Hereditary Diseases

Filipovic-Sadic, S.; Parker, C. A.; Mihailovic, M. K.; Milligan, J. N.; Turner, J. M.; Borel, S. L.; Le, V.; Markulin, T.; Janovsky, J. W.; Killinger, B. J.; Deshotel, M. J.; Reading, N. S.; Fredrickson, E. K.; Ji, Y.; Close, D.; Wright, J.; Williams, M.; Barrie, E. S.; Martin, K. E.; Gray, S. M.; Haynes, B. C.; Hall, B.

2026-05-19 genetics 10.64898/2026.05.14.725224 medRxiv
Top 0.3%
2.1%
Show abstract

PurposeCarrier screening for hereditary conditions is challenged by genes with complex genomic architecture, where short-read sequencing can fail to detect clinically relevant variants. This study evaluated a unified, amplification-based nanopore sequencing workflow across multiple laboratories for comprehensive analysis of such loci. MethodsA modular long-read sequencing assay was evaluated across five laboratories using targeted PCR enrichment, Oxford Nanopore sequencing, and automated variant analysis. The workflow interrogated genes associated with spinal muscular atrophy, thalassemia, cystic fibrosis, fragile X syndrome, congenital adrenal hyperplasia, Gaucher disease, and hemophilia A. Performance was assessed against orthogonal methods for single nucleotide variants (SNVs), indels, copy-number variants, repeat expansions, and structural rearrangements. ResultsAcross 882 unique samples (1,266 tests), overall agreement with comparator methods exceeded 96% for variant-level detection and 97% for genotype status classification. Long-read sequencing enabled phasing of paralogous loci, integrated sizing and interruption analysis for FMR1 repeats, and simultaneous detection of SNVs and structural variants in globin loci and CYP21A2-TNXB region, reducing reliance on multiple workflows. ConclusionThis multisite evaluation suggests that targeted long-read sequencing can consolidate complex variant detection into a single workflow, improving analytical completeness and operational efficiency for carrier screening.

17
Celiac Disease Risk Allele Frequencies in San Luis (Argentina) and Evaluation of a Saliva Direct PCR Genotyping Approach

Perez, C. N.; Pistone, C.; Romero, C.; Carrillo, A.; Manzur, M. J.; Chialva, C.; Quiroz, H.; Juri Ayub, M.

2026-05-21 genetic and genomic medicine 10.64898/2026.05.19.26353109 medRxiv
Top 0.3%
2.1%
Show abstract

Celiac disease (CD) is strongly associated with specific HLA DQ heterodimers, formed by HLA DQA1 and HLA DQB1 proteins. In particular DQ2.5 (DQB1*02 associated to DQA1*05) and DQ8 (DQB1*03:02 with DQA1*03) are present in virtually all celiac patients. HLA DQB1*02 is considered the main single genetic susceptibility marker and has been reported in 90 to 95% of CD patients. However, the distribution of these alleles may vary across populations, potentially impacting the performance of genetic screening strategies. In this study, we evaluated the prevalence of HLA DQ2.5 and DQ8 genotypes in celiac patients (n = 41) and an unbiased general population cohort (n = 60) from San Luis, Argentina, using a PCR-based genotyping approach. In addition, we assessed the feasibility of a simplified saliva direct PCR protocol for large scale testing. Overall, 95.1% of CD patients carried DQ2.5 and/or DQ8. Notably, 41.5% of patients were DQ8(+)/DQ2.5(-), and 36.6% lacked the DQB1*02 allele, indicating that DQB1*02 based screening alone would have reduced sensitivity in this population. In the general population, 53.3% of individuals carried CD associated genotypes, with a markedly higher prevalence of DQ8 compared to European cohorts. Genotype distributions deviated from Hardy Weinberg equilibrium in CD patients but not in the general population. We show that DQB1*03:02 is a reliable proxy for DQ8, allowing simplification of genotyping strategies, whereas DQA1*05 typing remains essential to discriminate DQ2.5 from other lower risk DQB1*02 carrying heterodimers. We also describe a saliva direct PCR approach showing a performance comparable to purified DNA based assays. These findings highlight the importance of population specific genetic data for optimizing CD screening strategies and foster the development of simplified, cost effective genotyping approaches for large scale applications.

18
Prevalence and Clinical Significance of Adult-Onset Cancer Predisposition Variants in Pediatric Oncology

Maciaszek, J. L.; Pastor Loyola, V.; Cain, T.; Cardenas, M.; Blackburn, P. R.; Wilkinson, M. R.; Koo, S. C.; Wu, C.-H.; Li, C.; Wang, L.; Nichols, K. E.; Klco, J. M.; Eldomery, M. K.

2026-06-08 genetic and genomic medicine 10.64898/2026.06.07.26354365 medRxiv
Top 0.3%
2.1%
Show abstract

Purpose: Pathogenic or likely pathogenic (P/LP) variants are increasingly identified in genes more commonly associated with adult-onset cancer predisposition, but their prevalence and relevance to children who present with cancer remain unclear. Methods: We retrospectively analyzed 1,280 consecutive pediatric patients with cancer who underwent clinical germline sequencing, using a virtual panel, from 2021 to 2024. Genes with P/LP variants were categorized as aoCPG or pediatric-onset cancer predisposition genes (poCPG) according to cancer risk before age 18 years and pediatric surveillance recommendations. Variant relevance was adjudicated using tumor diagnosis/histopathology, immunohistochemistry, and tumor molecular features and classified as primary, secondary, or indeterminate. Results: Among 1,280 patients, 197 (15.4%) harbored 211 P/LP variants across 54 genes. Sixty-six variants (31.3%) occurred in aoCPG, 87 (41.2%) in poCPG, and 58 (27.5%) were heterozygous variants in autosomal recessive genes. Among adult-onset variants, 7 (10.6%) were primary, 54 (81.8%) secondary, and 5 (7.6%) indeterminate. Among pediatric-onset variants, 77 (88.5%) were primary and 10 (11.5%) secondary. Six patients (3 adult-onset variants; 3 pediatric-onset variants) received targeted therapy informed by germline/somatic sequencing results. Conclusion: In pediatric oncology, most variants in aoCPG are secondary rather than tumor-related findings. Tumor-informed interpretation, beyond variant classification, may improve reporting, counseling, and therapeutic decision-making

19
Automated Versus Manual Reanalysis In Rare Disease Genomics

Kaschta, D.; Arriens, V.; Mueller, S.; Utermann-Thuesing, C.; Vater, I.; Caliebe, A.; Nagel, I.; Spielmann, M.

2026-05-19 genetic and genomic medicine 10.64898/2026.05.16.26352295 medRxiv
Top 0.3%
2.0%
Show abstract

Purpose. Periodic reanalysis of genome sequencing data can yield additional diagnoses as knowledge evolves, yet manual reanalysis is labour-intensive. We compared automated and manual reanalysis approaches in rare disease genomics. Methods. We reanalyzed 377 rare disease cases: 158 with pathogenic or likely pathogenic (P/LP) findings, 49 with variants of uncertain significance (VUS) findings, and 170 had no findings. Manual reanalysis used standard diagnostic workflow for all cases without prior P/LP diagnoses (219 cases). An automated pipeline using Talos was benchmarked on the 158 P/LP cases before application to the 219-case reanalysis cohort. The mean reanalysis interval was 660 days. Results. Manual reanalysis identified three additional P/LP cases and two newly classified as VUS, increasing P/LP cases from 158 (41.9%) to 161 (42.7%). Talos recovered all three P/LP findings but only identified one of the two new VUS findings. Benchmarking showed 80.0% singleton concordance and 75.2% (82.8% proband-only) trio concordance, with approximately three variants per case. Conclusion. Reanalysis at 1.8 years yields modest but clinically meaning- ful gain. Automated reanalysis closely approximates manual performance while reducing hands-on effort, supporting scalable reanalysis in routine genomic care. Keywords: rare disease genomics, genome sequencing, automated reanalysis, variant prioritization, Talos, diagnostic yield

20
Artificial Intelligence-Based Chatbots in Genetic Counseling Practice: Current Uptake, Utilization, and Perspectives

Daley, N.; Griswold, A.; Moreno, L.; Floyd, A.; Duong, D.; Solomon, B. D.; Waikel, R. L.

2026-05-24 genetic and genomic medicine 10.64898/2026.05.21.26353789 medRxiv
Top 0.3%
2.0%
Show abstract

AI-driven chatbots have been utilized in healthcare to automate administrative tasks, improve patient education, and expand access to medical information; however, their role in genetic counseling remains underexplored. To investigate the adoption, perceptions, and potential utility of AI-based chatbots in genetic counseling practice, 217 genetic counselors and genetic counseling students from across North America were surveyed regarding chatbot usage, confidence in their application, and perceived benefits and limitations. While most participants (166/217; 76.5%) reported using general AI chatbots outside of clinical settings, far fewer (18/204; 8.8%) reported using or recommending clinical genetics chatbots in clinical practice. For those that used clinical genetics chatbots, the primary purpose was for communication with at-risk family members (11/18; 61.1%) and patient education (10/18; 55.6%). Confidence in chatbot technology varied, with highest confidence in gathering family history information (81/199; 40.7%) and lowest confidence in their ability to disclose variants of uncertain significance or positive genetic testing results (5/199; 2.5%). The greatest perceived benefits included reducing repetitive tasks (165/195, 84.6%) and allowing for time for other tasks (141/195; 72.3%), while major concerns revolved around patient comprehension (167/195; 85.6%) and having accurate, up-to-date information (145/195; 74.4%). Despite some concern about AI replacing human counselors, most participants reported they felt there was potential for chatbots to enhance workflow efficiency (128/195; 65.6%) if properly integrated and regulated. Limited AI training was identified as a barrier to adoption (16/195; 8.2% received training), highlighting a need for structured education on AI applications in genetic counseling. These findings suggest that AI chatbots hold promise as supplementary tools, but significant challenges must be addressed before widespread implementation in genetic counseling practice.